A DCRNN-based ensemble classifier for speech emotion recognition in Odia language
نویسندگان
چکیده
Abstract The Odia language is an old Eastern Indo-Aryan language, spoken by 46.8 million people across India. We have designed ensemble classifier using Deep Convolutional Recurrent Neural Network for Speech Emotion Recognition (SER). This study presents a new approach SER tasks motivated recent research on speech emotion recognition. Initially, we extract utterance-level log Mel-spectrograms and their first second derivative (Static, Delta, Delta-delta), represented as 3-D Mel-spectrograms. utilize deep convolutional neural networks to the features from Then bi-directional-gated recurrent unit network applied express long-term temporal dependency out of all produce emotion. Finally, use classifiers Softmax Support Vector Machine improve final recognition rate. In this way, our proposed framework trained tested (Seven emotional states) RAVDESS (Eight dataset. experimental results reveal that performs better instead single classifier. accuracy levels reached are 85.31% 77.54%, outperforming some state-of-the-art frameworks datasets.
منابع مشابه
Classifier Ensemble Framework: a Diversity Based Approach
Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition,...
متن کاملemotional speech recognition and emotion identification in farsi language
speech emotion can add more information to speech in comparison to available textual information. however, it will also lead to some problems in speech recognition process. in a previous study, we depicted the substantial changes of speech parameters caused by speech emotion. therefore, in order to improve emotional speech recognition rate, in a first step, the effects of emotion on speech par...
متن کاملSpeech Emotion Recognition With TGI+.2 Classifier
We have adapted a classification approach coming from optical character recognition research to the task of speech emotion recognition. The classification approach enjoys the representational power of a syntactic method and efficiency of statistical classification. The syntactic part implements a tree grammar inference algorithm. We have extended this part of the algorithm with various edit cos...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملA Novel Classifier Based on Enhanced Lipschitz Embedding for Speech Emotion Recognition
The paper proposes a novel classifier namedELEC (Enhanced Lipschitz Embedding based Classifier) in the speech emotion recognition system. ELEC adopts geodesic distance to preserve the intrinsic geometry of speech corpus and embeds the high dimensional feature vector into a lower space. Through analyzing the class labels of the neighbor training vectors in the compressed space, ELEC classifies t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2022
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-022-00713-w